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Abstract 

We investigate the following vertex percolation process. Starting with a ran- 
dom regular graph of constant degree, delete each vertex independently with prob- 
ability p, where p = n~ a and a = a(n) is bounded away from 0. We show that 
a.a.s. the resulting graph has a connected component of size n — o(n) which is an 
expander, and all other components are trees of bounded size. Sharper results are 
obtained with extra conditions on a. These results have an application to the cost 
of repairing a certain peer-to-peer network after random failures of nodes. 

1 Introduction 

In this paper we investigate the effect of randomly deleting some vertices in a random 
regular graph. Take a random (i-regular graph G on n vertices and independently delete 
each vertex with probability p. The result is a random graph G with maximum degree 
at most d. We analyse the structure of G, with particular focus on whether (the largest 
connected component of) G is an expander graph. Here d is fixed, n tends to infinity 
such that dn is even, and we take p = n~ a for some function a = a(n). In this paper 
we treat only the case where a is bounded away from 0, since otherwise even the largest 
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component of the graph is not an expander. Our work is motivated by an application 
in peer-to-peer networks, as described below. 

In Section [LT] we describe our main result. Related work is described in Section [L2l 
The application to a certain peer-to-peer network is explained in Section 11.31 Our 
calculations will be carried out in the configuration model which is described in Section |2l 
Then our calculations are presented in Section [3j 

1.1 Notation, terminology and our main result 

There are several related definitions of expander graphs. We will say that a graph G on 
n vertices is a (3-expander if every set S of s < n/2 vertices has at least (3s neighbours 
outside S. An alternative definition involves d(S), the sum of the degrees of vertices in 
S, and e(S), the number of edges leading out of S, and defines G to be an 7-expander if 
e{S) > ld(S) for all sets S C V(G) of vertices with d(S) < \E(G)\. For bounded-degree 
graphs these give equivalent notions of expanders, up to a constant factor in translating 
7 to p. 

In this paper, all asymptotics are as n — > 00. We say that an event holds asymp- 
totically almost surely (a.a.s.) if the probability that it holds tends to 1. We adapt the 
standard O(-), o(-) notation to accommodate versions which hold a.a.s., following [TJJ 
Section 8.2.1]. Specifically, let f(n), g(n) and <p(n) be functions such that |/| < <pg. If 
4>(ri) is bounded for sufficiently large n then we write / = 0(g), and if — > as n — > 00 
then we write / = o(g). When f/g = 1 + o(l) then we write / ~ g and say that / 
and g are asymptotically equal. If a statement S about random variables involves the 
notations O(-) or o(-) then S is not an event, and we define "a.a.s. 5"' to mean that all 
inequalities of the form |/| < <pg which are implicit in S hold a.a.s.. 

Let Q n ^d denote the uniform probability space of all (simple) <i-regular graphs on the 
vertex set [n] = {1, . . . , n}. Our main result is the following. 

Theorem 1. Fix d > 3 and a constant rj > 0. Suppose that a = a(n) satisfies 

a(n) > rj (1) 

for n sufficiently large. Let G G Q n ^ and let G be the graph obtained by independently 
deleting vertices of G independently with probability n~ a . Then 

(a) there is a constant (3 > such that a.a.s. G has a connected component of size 
n — o(n) that is a (3-expander, and all other components are trees of bounded size; 

(b) ifr}> 2priy then there is a constant (3 > such that a.a.s. G consists of a connected 

component that is a (3- expander, together with o(n^ 2 ^ ( - 2d ^ 2 ^) isolated vertices; 

(c) if t] > -Ar then then there is a constant (3 > such that a.a.s. G is a (3-expander. 

The result in (a) is best possible, in the sense that if a goes to in the deletion 
probability n~ a then there is no fixed positive expansion rate: that is, there is no fixed 
(3 > as stated in the theorem. The reason for this is as follows. It can be shown by 
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the second moment method that if k < l/(a(d — 2)) then there are a.a.s. many paths 
of degree 2 vertices of length at least k in the large connected component. Any one 
such path causes the expansion rate to be at most at most 2/(k — 1). This is explained 
further after Lemma [5] below. 

1.2 Related work 

While the vertex-deletion process which we analyse in this paper does not seem to 
appear in the literature, there are various papers [7J El [H] investigating the result 
of deleting edges of random regular graphs independently with some given probability. 
This is usually described as edge percolation, and the resulting graph is sometimes called 
the faulty graph. These papers are also motivated by applications to communications 
networks. Nikoletseas et al. [11] focus on the connectivity properties of the faulty graph, 
and undertake a study somewhat similar to ours. Goerdt [H Theorem 2] proves that for 
small constant edge deletion probability, there is a linear-sized component of the faulty 
graph. However, it is not an expander. Goerdt and Molloy [8] extend this analysis 
to give a threshold on the fault probability for the existence of a linear sized fc-core 
whenever 3 < k < al. (The fc-core of a graph is the unique maximal subgraph in which 
each vertex has degree at least k, see for example |4, p. 150].) The k-core is with 
high probability an expander, but only contains some proportion of the vertices. These 
results are considering much higher deletion probabilities than we do in the present 
paper, because they tolerate a very large number of disconnected vertices: linear in n. 

The paper of Alon et al. pQ considers edge percolation on expander graphs, which 
includes random regular graphs of degree at least 3. (Though they consider graphs 
of high girth, this is a minor detail.) They determine the threshold at which a giant 
component exists. They also give a result p., Proposition 5.1] on the expansion of the 
giant component when the edge deletion probability tends to 0. This involves (1/ logn) 
expansion however, not constant rate expansion. For random regular graphs, Pittel [12] 
gave a more detailed analysis and determined the order of the transition window of 
appearance of a giant component in a random regular graph under edge percolation. 

1.3 Application to a peer-to-peer network 

The vertex deletion process which we study in this paper is motivated by an application 
to a peer-to-peer network proposed by Bourassa and Holt [HIE]. This network, called the 
Swan network, is based on random regular graphs. Under normal operating conditions, 
the network is given by a ci-regular graph, where al > 4 is an even constant (in practice 
d — 4). Bourassa and Holt claimed that their networks quickly acquire some desirable 
characteristics of uniformly distributed random regular graphs, such as high connectivity 
and logarithmic diameter. (Note that random <i-regular graphs s. expander 

graphs for d > 3 [3], and as such they are connected and have logarithmic diameter. 
Specifically, it is well known and easy to see that if a graph G is a 7-expander then G 
has diameter which is bounded above by log 1+7 (n/2).) 
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Cooper, Dyer and Greenhill [6] gave theoretical support to these claims by defining 
a Markov chain to model the behaviour of the Swan networks. They showed that 
under certain natural assumptions about arrival and departure rates, and with a slight 
alteration of the mechanism of departure, the Markov chain converges rapidly to its 
stationary distribution, which is uniform when conditioned on a fixed number of vertices. 
While random ci-regular graphs s. connected for d > 3 (indeed, c/-connected), a 

Swan network in the absence of departures is always connected. 

In the context of peer-to-peer Swan networks, the random deletion of a vertex corre- 
sponds to a client failing. Edges correspond to LAN or Internet connections and so are 
far more robust. Individual clients fail due to lost power, shut down or logoff events, 
frozen applications, and similar phenomena. Hence our exclusive consideration of vertex 
deletions, rather than edge deletions. 

Swan networks are self-administering. In particular, they are self-healing after the 
loss of some vertices, completing a <i-regular graph among the remaining vertices. For 
Swan networks, there are two processes to handle lost neighbours: an inexpensive process 
that uses messages internal to the graph, and a more expensive process that contacts 
vertices using messages external to the graph. As long as the graph remains connected, 
the repairs can safely use the internal repair mechanism. Hence for this application it is 
desirable that the majority of clients in the network remain in a connected component. 

Theorem [1] models this situation and shows that the large connected component is 
an expander, which has three important implications for Swan networks. First, under 
certain constraints on the probability of node failures, Swan networks tend to remain 
connected under the simultaneous loss of several nodes. Second, deletions do not degrade 
the log-diameter of the Swan networks. Finally, Theorem [1] implies that the current 
internal repair strategies could be modified, efficiently involving more of the remaining 
nodes in the repair. 

2 The configuration model and some definitions 

As is usual in this area, calculations are performed in the configuration model (or pairing 
model), see for example [13] or [TUl Chapter 9]. A configuration consists of n buckets 
with d points each, and a perfect matching of the dn points chosen uniformly at ran- 
dom. The edges of the perfect matching are called pairs. Assume that the buckets are 
labelled 1, . . . , n and that within each bucket the points are labelled 1, . . . , d. Denote 
this probability space by V n ^- Given a configuration P e V nt d we obtain a pseudograph 
G(P) by shrinking each bucket down to a vertex. This pseudograph may have loops 
and/or multiple edges, but the probability that it is simple (with no loops or multiple 
edges) is bounded below by a constant. Moreover, conditioned on G(P) being simple, 
it is uniformly distributed. 

Similarly if d = . . . , d n ) is the degree sequence of a graph, then P n d denotes the 
configuration model where the jth bucket contains dj points, and a perfect matching 
of the 2m = Y^=i dj points is chosen uniformly at random. Here we assume that the 
buckets are labelled 1, . . . , n and that the points in the jth bucket are labelled 1, . . . , dj. 
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We can now define the bucket deletion process for configurations. For the remainder 
of the paper, assume that ([T]) holds for some positive constant 77, for n sufficiently large. 
Given P e V n ,d, form a new configuration P by independently deleting each bucket with 
probability p. Specifically: 

• choose a random subset R of buckets such that b e R with probability p = n~ a , 
independently for each bucket b, 

• delete all buckets in R, 

• delete every pair with an endpoint in a bucket in R, together with the other 
endpoint of the pair if it lies outside R, 

• relabel the surviving buckets with the labels 1,2,.. ., preserving the relative or- 
dering of the buckets, 

• relabel the points within each surviving bucket in the same way. 

Note that the same distribution on P will result if the set R of buckets to delete is 
chosen first, and then P e V n ,d is selected. 

We now give some definitions which we will need. A connected component of a graph 
which is a tree will be called an isolated tree, and a connected component of a graph 
which is a cycle will be called an isolated cycle. 

The 2-core of a graph G, denoted by cr(G), is obtained from G by the following 
process: let Gq = G and for t > 0, if G t contains a vertex v of degree or 1 then let 
Gt+i = Gt — v, otherwise stop. The final graph is cr(G). From the 2-core cr(G) of G we 
obtain the kernel of G, denoted by ker(G), by suppressing all vertices of degree 2. That 
is, if v is a vertex of degree 2 in G' with neighbours {a, b} then delete v and replace 
these two edges by the edge {a, b}. 

Given a graph G, an edge of G is a cyclic edge if it belongs to a cycle, or to a path 
joining two cycles. The cyclic edges are precisely those of the 2-core. The subgraph of 
G induced by the non-cyclic edges is a union of some number of components. We call 
each of these components a bush. If a bush B has a vertex which is incident with at 
least one cyclic edge of G then this vertex is called the root of B. Following from these 
definitions, a bush can have at most one root, and the bushes are pairwise disjoint. 

We will say that a configuration P has some property if the corresponding graph 
G(P) has that property. This allows us to speak of paths and cycles in a configuration 
P, as well as subconfigurations of P which are trees, bushes and so on. In particular we 
can define the 2-core and kernel of a configuration. 

We will need the following lemma which has a very straightforward proof and can 
be found in [U p. 54]. 

Lemma 1. Let k be a fixed positive integer and let d = (di, . . . , d n ) be a degree sequence 
satisfying < dj < d for all i. Then the probability that a random element of V n ^ 
contains k specified pairs is (1 + o(l))(2m)~ fc , where m = (di + - ■ ■ + d n )/2 is the number 
of pairs in the configuration. 
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If an event is a.a.s. true for G{P) when P G V n ,d, then it is also a.a.s. true conditional 
on the event that G(P) is simple. This comes immediately from the fact that the 
probability that G(P) is simple for P G V n .d is bounded below by a nonzero constant 
(see for example pfl, p. 55]). This is the way that many results about Q n ^ have been 
proved using V n ^- 



3 The details 

Let P G V n ,d and let R be the random set of buckets chosen for deletion. Write r = \R\. 
By the well known sharp concentration of binomials, since a is bounded away from 0, 

cL. ct.S. 

r ~ n 1 -* (2) 

provided n 1 ^ — ► oo. Until we come to the proof of Theorem [1] we will assume that the 
latter condition holds, so that ([2]) holds. The other case is easily handled afterwards. 

Let P be the result of deleting the buckets in R from P (and performing the necessary 
relabellings of buckets and points). Then P has n — r buckets. Let dj denote the 
number of points in the jth bucket of P, and say that bucket j has degree dj. Thus 
< dj < d. The the degree sequence of P is (di, . . . , d n _ r ) and number of pairs in P is 
(g?i + . . . + d n _ r )/2. Let Nj be the number of buckets of P with degree j, for < j < d. 
The following result shows that we can use V n ^ to model P, conditional upon it having 
degree sequence d. 

Lemma 2. The pairing P is uniformly random conditioned on its degree sequence d = 

(di, . . . , d n - r ). 

Proof. First notice that the set R determines an injection (p : [n — r] [n] which is the 
inverse of the relabelling operation performed when P is constructed. The probability 
of a particular P with degree sequence d = (d\, . . . , d n - r ) is given by 

N 2 l\V n4 \ 




where 



(") is the number of order- preserving injections ip : [n — r] — > [n] , giving the labels 
of the buckets from P in P, 

(^) is the number of order-preserving injections from [dj] to [d], giving the labels 
of the points from bucket j of P in bucket <p(j) of P, 

n~ ar is the probability that the r buckets of P which do not correspond to buckets 
of P are deleted, 
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d = (di, . . . , d n ) is the degree sequence given by 

di - 



d if i £ cp([n — r]), 

d-dj ifi = p(j), 



• iVg is the number of configurations with degree sequence d, giving the number of 
ways to complete the configuration P. 

Since the above expression depends only on d and not on the particular structure of P, 
it follows that P is uniformly random conditioned on its degree sequence d. □ 

For < j < d let 

Lemma 3. Assume that r satisfies (dp. Form P from P E V n ,d by deleting the buckets 
in R as described in Section^ Then, for < j < d, we have ENj ~ fij and a.a.s. 

Nj^fij if Hj-^oo, 

iV, = 0(loglogn) 1/^ = 0(1), (3) 
.V, ^Mi = o(l). 

In all cases, a.a.s. Nj = o(^) for < j < £ < d. 

Proof. Fix j G {0, . . . ,d}. Choose a random configuration P e V n ^- The probability 
that a given bucket b ^ R is incident with exactly d — j pairs which are incident with 
points in R is asymptotically equal to 

(The first factor chooses d — j points in b and the second factor chooses d — j points in R. 
There are (d — j)\ ways to match up these points using pairs, and the probability that a 
random element of V n ,d contains these pairs is (dn)~^ d ~^\ by Lemma [TJ) Therefore by 
linearity of expectation, 

ENj ~ fij, 

proving the first statement. 

Now suppose that fij — > oo. Similar calculations for an ordered pair of buckets 
b,c ^ R show that a.a.s. E[A/j] 2 ~ (ENj) 2 . This establishes the sharp concentration of 
Nj whenever \ij — > oo. The other two statements in ([3]) follow from Markov's inequality, 
as does the final statement of the lemma. □ 

Now fix a positive integer K such that 

2 

K > 



(d-2)77 5 
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where rj is the constant from ([TJ). Recall the definition of a bush given before the 
statement of Lemma 1. Note that if a bush B in P is not an isolated tree then it has a 
root, and for each non-root bucket v of B, the degree of v in P is the same as in P. 

Lemma 4. Let d = (di, d%, . . . , d n - r ) be a degree sequence such that < d{ < d for all 
i, r satisfies $M), and Nj satisfies (TJ|) for < j < d. Let P G V n ,d- Then a.a.s. P has 
no bushes with more than K buckets (including isolated trees). 

Proof. First observe that every tree on k > 2 buckets has at least k/2 + 1 buckets of 
degree 1 or 2. (This can be proved using induction.) Suppose that P contains a bush 
B with more than K + 1 buckets. Then P contains a bush with exactly k + 1 buckets, 
for some k between K < k < 2K. To see this, suppose that B has more than 2K + 2 
buckets. Let b be any bucket of B if it is an isolated tree, or let b be the root bucket of 
B otherwise. Then at least one neighbour of b, say b', is the root of a (smaller) bush B' 
in P with more than K buckets. By induction on B', the result follows. 

So now let B be a bush with k + 1 vertices, where K < k < 2K, and let S be the 
set of buckets in B. Ignoring the root bucket (which may have higher degree in P than 
it does in B), it follows that there are at least k/2 buckets in S with degree 1 or 2 in 
P. Moreover there are k pairs in P between points in buckets of S. 

Now we prove that a.a.s. there are no such sets 5* of buckets in a random element of 
V n ,d- There are N\ + N 2 buckets in P of degree 1 or 2, and by ([3]), a.a.s. 



0(/i 2 ) if /i2 — > oo or /i 2 = o(l), 

O(loglogn) if// 2 = 0(l). 

(Here if fi 2 —> and /ii = 0(1) then we use the fact that a.a.s. Ni = 0(/i2), rather 
than the arbitrary upper bound of O(loglogn) from ([3]).) Hence there are a.a.s. at most 

0(l)n k l 2+l g(n) k l 2 

ways to choose the buckets belonging to the set S, where 

f fi 2 = n ( 1 -( d - 2 >«> if / u 2 -^ooor/i 2 = o(l), 
g[n) = < (4) 
[log log n if/i 2 = 0(l). 

There are 0(1) ways to choose locations for the k pairs between points of S, and the 
probability that a random element of V n ^ contains these pairs is 0(n~ k ), by Lemma [TJ 
Therefore the expected number of such sets S in P is a.a.s. 

0{l)n k l 2+l g{n) k ' 2 n- k . 

This is clearly o(l) if <?(n) = log log n, and otherwise 

0(l)n k l 2+1 g(n) k ' 2 n- k = 0(^-2)^/2) 

= o(l) 

by choice of K. Hence by Markov's inequality in either case there are a.a.s. no such sets 
S, for K < k < 2K. The lemma follows. □ 
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To create the 2-core cr(P) of P, start with P and delete all buckets of degree 0. Then 
while any buckets of degree 1 remain, delete one at each time step until none remain. 
Finally, relabel the remaining buckets and the points within the remaining buckets, 
respecting the relative ordering. This process is equivalent to deleting all isolated trees 
and "pruning" all bushes of P (where pruning involves deleting all buckets of the bush 
except the root, and deleting all pairs incident with any non-root bucket of the bush), 
followed by relabelling. Denote the number of buckets in cr(P) by t and let d' = 
{d' x , . . . , d' t ) be the degree sequence of cr(P). This defines iVj, the number of buckets in 
cr(P) with degree j, for 2 < j < d (since cr(P) has no buckets of degree or 1). 

Lemma 5. Let d = (dx,...,d r ) be as in Lemma^ Let P 6 V n ,d- Then the 2-core 
cr(P) of P has the following properties: 

(i) a.a.s. t ~ n — r and N'- ~ Nj for 2 < j < d, 
(ii) cr(P) is uniformly random conditioned on its degree sequence, 
(Hi) a.a.s. cr(P) has no isolated cycles, 

(iv) a.a.s. cr(P) has no paths of length at least K + 1 where all internal vertices have 
degree 2. 

Proof. By Lemma HI a.a.s. all bushes and isolated trees in P have at most K buckets 
(including the root). Hence the total number of buckets of P contained in bushes is 
a.a.s. 0(/ii) unless lim^oo/ix = 0(1) and fi x ^ o(l), in which case an upper bound is 
given by O(loglogn). Note also that by LemmaE], a.a.s. 

Nx = o(/i^) and fid ~ n — r. 

It follows that cr(P) has t buckets where a.a.s. 

t = n — r — o(n — r) ~ n — r. 

By Lemma [3] again it follows that a.a.s. N'- ~ Nj for 2 < j < d. This proves (i). 

The proof of (ii) is similar to the argument given in the proof of Lemma [3] and for 
similar statements in papers on cores of random graphs, so we do not include it here. 

Let m = (d[ + ■ ■ ■ + d' t )/2 be the number of pairs in cr(P). By (ii) we know that, 
conditioned on having degree sequence d', cr(P) has the distribution of V n ^'- Using 
this and Lemma HJ the expected number of isolated fc-cycles in cr(P) is at most 

0(1) Cf) (k-l)\(2mr k = 0(l) (f 



for 2 < k < t. Therefore the expected number of isolated cycles in cr(P) is at most 



m J 1 — N' 2 /m 
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and since a.a.s. N' 2 = o(m) by Lemma El we see that a.a.s. the expected number of 
isolated cycles in cr(P) is o(l). This establishes (hi), by Markov's inequality 

Finally, the expected number of paths in cr(P) of length K + 1 with K internal 
buckets of degree 2 is at most 

0(1) t 2 K\ (2m)-( x+1 ) = 0(n) 

using Lemma [Hand (i). Using (Ej), a.a.s. this expression is 

0{n x - K )g{n) K 

where g(n) is defined in (00). Using calculations as in Lemma HI this bound is o(l). 
Applying Markov's inequality establishes (iv) and completes the proof. □ 

Following the calculations in this proof, we can now see why the result of Theorem Q] 
is best possible, in the sense outlined in the introduction. Suppose that p = rr e where 
e > may be arbitrarily small. Choose a positive integer k > 2 such that e(d — 2)k < 1. 
(By choosing small enough e we may choose k to be arbitrarily large.) With this deletion 
probability we have 

A*2 > (^j nl ~ 1/k ~* °°. 

so the expected number of paths in cr(P) with length at least k and with at least k — 1 
internal vertices of degree 2 is at least 

which tends to infinity. Standard variance calculations show that the number of such 
paths is sharply concentrated, so there is a.a.s. at least one such path in cr(P). This 
implies that the expansion constant of cr(P) is at most 2/(k — 1). Conditioning on 
the event that G(P) is simple, we have the same conclusion (see the end of Section [2]). 
Hence when p = n~ e where e = o(l), we may take k — > oo, there is no fixed positive 
expansion rate (3, and the conclusion of Theorem [1] does not hold. 

For practical applications such as the Swan networks, a constant but very small 
deletion probability is the most natural assumption. For the range of n of interest in 
the applications, the probability would be at most n~ £ for some small positive constant 
e that is not extremely small. For values of the parameters determined in this way, we 
would expect the asymptotic trends studied in this paper to be accurate. 

Proof of Theorem [H Fix a positive integer d > 3 and a constant rj > such that (PQ) 
holds for n sufficiently large. Let P G V n ^ and form P from P as described in Lemma [21 
We first treat the case that n 1_Q — > oo, so that (j25 holds a.a.s., and we prove the 
conclusions of the theorem for the multigraph G(P). Only at the end do we remove this 
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assumption and translate the result to G G Q n ,d- We have by Lemma [21 P e V nt d where 
d is the degree sequence after deletion. Let cr(P) be the 2-core of P. Then a.a.s. the 
conclusion of Lemmas El HI E all hold. 

Condition on the event that all these conclusions hold, and let ker(P) be the kernel 
of P. Then ker(P) is obtained from cr(P) by suppressing the degree-2 buckets. That 
is, if b = {pi,p2} is a degree-2 bucket in cr(P) involved in pairs {pi,x}, {p2,y}, then 
delete b, remove these pairs and add the pair {x, y}. Since cr(P) has no isolated cycles, 
ker(P) has exactly iVj buckets of degree j for 3 < j < d (and no buckets of degree less 
than 3). For the reasons given in Lemma [5] (ii), we omit the arguments that show that 
ker(P) is uniformly random conditioned on its degree sequence. Let H = G(ker(P)) 
be the multigraph obtained from ker(P) by shrinking buckets to vertices and replacing 
pairs by edges. From (2j Lemma 5.3], for some constant 5 > the multigraph H is a.a.s. 
a 5-expander. (This is well known: for example, a version of this is mentioned in [7] 
without proof.) The constant S depends only on t]. At this point we further condition 
on this asymptotically almost sure expansion event holding. 

Let G(P) be the multigraph corresponding to the pairing P. We obtain G(P) from 
H by performing the following steps: 

• replace some edges by paths of length at most K, 

• glue on some bushes of size at most K by identifying their roots with distinct 
vertices, 

• introduce some isolated trees of size at most K, 

• perform the appropriate relabellings of vertices. 

Since we are conditioning on the event that the conclusions of Lemma [3] hold, G(P) 
consists of O(Ni) = o(n) vertices in isolated trees of size at most K, together with a 
large component having n — 0{N\) vertices. Let U be the large component, and let 
u = \U\. Note that \H\ = u — o(u). We now show that U is an expander. 

Fix any subset S C V(U) with \S\ < u/2. By an object we mean any bush which 
has been added to H, or path replacing an edge of H, or edge of H not replaced by a 
path, in the process of creating U from H . An object includes the vertex or vertices of 
H where it is attached. An object is partially occupied if it has some vertices in S and 
some not in S, and it is fully occupied if all its vertices belong to S. 

First suppose that there are at most e\S\/K partially occupied objects, where e > 
is a constant. Then at most e\S\ vertices of S are in partially occupied objects. For 
each fully occupied object there are at most K — 1 vertices not in H and at least one 
vertex in H. Each of these vertices is involved in at most d objects, so the number of 
vertices in V(H) n S is at least \/{d(K — 1) + 1) > 1/dK times the number of vertices 
in fully occupied objects. Since all vertices of S are in either partially or fully occupied 
objects, it now follows that 

\v(B)ns\>^-\s\. 



n 



Let A be the set of vertices in V (H) \S which have a neighbour in V(H) C\S. We claim 
that there exists a constant 7 > such that \A\ > 7 \V(H) n S\. If \V(H) n S\ < \H\/2 
then the claim follows immediately because if is a 5-expander. So we may assume that 
\V(H) DS\> \H\J2. Then \B\ < \H\/2, where B = V(H) \(SU A), so the expansion 
of H implies that \A\ > S\B\ and hence 

1 1 - 1 + 5 1 1 1 + 5 1 v 7 x 1 ~ 1 + 5 11 

as I if I ~ u and j^l < w/2. Thus, the claim holds with 7 = 5/(3 + 35). The claim implies 
that there are at least 

i\v{H)ns\> 1 -^ ! fi\s\ 

partially occupied objects. 

So we may suppose that for some e > 0, there are more than e\S\/K partially occu- 
pied objects. Each partially occupied object contains an element of S with a neighbour 
in V(U) \ S. Since each vertex in U has degree at most d, each of these neighbours can 
be incident with at most d partially occupied objects. Therefore S has at least e\S\/dK 
neighbours outside S. It follows from this that the large component U is a constant rate 
expander, under our assumptions. The conclusion of part (a) of the theorem now follows 
for the initial random multigraph G(P) in place of G G G n ,d: under the assumption that 

l—a 

n — > 00. 

For (b), we need to show further that a.a.s. the only isolated trees in P are isolated 
vertices. By Lemma [31 the expected number of isolated trees with at least two leaves 
and k — 2 other vertices is 

n *-2 n 2(l-(rf-l)«) q („-(*-!)) = 0(n l-2(rf-l)«) = o(1) . 

Hence the isolated trees s. isolated vertices, as required. Also, the number of 

isolated vertices is iVo and if no — >• 00 then a.a.s. 

N ~ f x = n 1 - ad = o(n 1 - d / ( - 2d -V) 

for the given bound on r]. In the other cases we still have No = o(n( d ~ 2 ^( 2d ~ 2 ^) a.a.s., 
using Lemma [3l 

For (c), the conclusion of (b) still applies, but in addition, in this case EiVx = o(l). 
So there are a.a.s. no isolated trees or bushes of any size, and the conclusion of (c) 
follows for the multigraph G(P). 

This completes the proof of the theorem except for two aspects. First, we transfer 
the conclusions from the initial random multigraph G(P) to G G G n ,d- This is done by 
conditioning on the event that G(P) is simple. As explained at the end of Section [21 
the truth of these asymptotically almost sure results is not affected. In the conditional 
space, G(P) becomes G as in the statement of Theorem [TJ 

Finally, we only need to dispense with the assumption that n 1_a — > 00. Assume that 
n l ~ a = 0(1). We may apply the version of (c) already proved, to conclude that the 
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deletion of vertices from G £ Q n ^ with probability n~ 3 / 4 a.a.s. produces a /3-expander 
G' . If we then reinstate each deleted vertex (and incident edges) independently with 
probability 1 — n 3 / 4 ~ Q (noting this is positive for n sufficiently large) then the result 
is the same as deleting each vertex of the original graph with probability n~ a , that is, 
it produces G. The vertices deleted from G to produce G' are, by easy first moment 
considerations, a.a.s. of distance at least 3 from each other. In this case, reinstating them 
cannot create any new components, and it is easy to see that after reinstating them, the 
resulting graph G is a (/3/2)-expander when f3 < 2. The only nontrivial case is when 
S C V(G) satisfies \S — W\ < \S D W\, where W is the set of reinstated vertices. Here 
each vertex of S n W has d neighbours outside W, giving d\S n W\ distinct neighbours 
of S fl W outside W. At most \S — W\ of these can lie in S, so S has at least 

d\snw\ -\s-w\> (d-i)\snw\ > ^^\snw\ 

neighbours outside S in G. This gives /9/2-expansion when f3 <2. □ 
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